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Abstract. We discuss a class of conditionally heteroscedastic time series models 
satisfying the equation r t = Ct&t, where Ct are standardized i.i.d. r.v.’s and the 
conditional standard deviation at is a nonlinear function Q of inhomogeneous 
linear combination of past values r s ,s < t with coefficients bj. The existence of 
stationary solution rt with finite pth moment, 0 < p < oo is obtained under some 
conditions on Q, bj and the pth moment of Co- Weak dependence properties of 
rt are studied, including the invariance principle for partial sums of Lipschitz 
functions of rt . In the case when Q is the square root of a quadratic polynomial, 
we prove that rt can exhibit a leverage effect and long memory, in the sense that 
the squared process has long memory autocorrelation and its normalized 
partial sums process converges to a fractional Brownian motion. 

Keywords: ARCH model, leverage, long memory, Donsker’s invariance princi¬ 
ple 

1 Introduction 

A stationary time series {ry,f € Z} is said conditionally heteroscedastic if its conditional 
variance cr 2 = Var[rt|r s ,s < t] is a non-constant random process. A class of conditionally 
heteroscedastic ARCH-type processes is defined from a standardized i.i.d. sequence {Q. t € 
Z} as solutions of stochastic equation 


r t =(tcrt , a t = V(r s ,s <t), (1.1) 

where V(xi,X 2 , • • •) is some function of aq, X 2 , • • •. 

1 /9 

The ARCH(oo) model corresponds to V(x\,X 2 , ■■■) = (a + ^j x 'f) > or 

OO 

a t = a + ^2 b J r t-j, (!- 2 ) 

3 = 1 
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where a > 0, bj > 0 are coefficients. 

The ARCH(cx)) model includes the well-known ARCH(p) and GARCH(p, q) models of Engle 
m and Bollerslev [6]. However, despite their tremendous success, the GARCH models are 
not able to capture some empirical features of asset returns, in particularly, the asymmetric 
or leverage effect discovered by Black |5j, and the long memory decay in autocorrelation 
of squares {rf}. Giraitis and Surgailis m proved that the squared stationary solution 
of the ARCH(oo) model in (|1.2I) with a > 0 always has short memory, in the sense that 
Cov(rQ, r|) < oo. (However, for integrated ARCH(oo) models with YlJLi bj = 1, bj > 
0 and a = 0 the situation is different; see m-) 

The above shortcomings of the ARCH(oo) model motivated numerous studies proposing 
alternative forms of the conditional variance and the function V(x\, X 2 , ■ ■ ■) in (11.11) . In 
particular, stochastic volatility models can display both long memory and leverage except 
that in their case, the conditional variance is not a function of r s , s < t alone and therefore 
it is more difficult to estimate from real data in comparison with the ARCH models; see 
Shephard and Andersen m- Sentana [30] discussed a class of Quadratic ARCH (QARCH) 
models with of being a general quadratic form in lagged variables r t - 1 , • • • , rt- p . Sentana’s 
specification of of encompasses a variety of ARCH models including the asymmetric ARCH 
model of Engle m and the ‘linear standard deviation’ model of Robinson m ■ The limiting 
case (when p = oo) of the last model is the LARCH model discussed in |15j (see also m, 
a, [in, m) and corresponding to V(x\,X 2 , • • •) = a + bjXj , or 

OO 

a t = a + yf bjr t -j, (1.3) 

j =i 

where a & M, bj £ M are real-valued coefficients. m proved that the squared stationary 
solution {rf} of the LARCH model with bj decaying as j^ 1 , 0 < d < 1/2 may have long 
memory autocorrelations. The leverage effect in the LARCH model was discussed in detail 
in m- On the other hand, volatility at (11.31) of the LARCH model may assume negative 
values, lacking some of the usual volatility interpretation. 

The present paper discusses a class of conditionally heteroscedastic models (11.11) with V 
of the form 

OO 

V(xi,X 2 ,---) = Q( a + bjXj), (1.4) 

3 = 1 

where Q(x),x £ 1 is a (nonlinear) function of a single real variable x € R which may 
be separated from zero by a positive constant: Q(x) > c > 0, x G M. Linear Q(x) = x 
corresponds to the LARCH model (11.31) . Probably, the most interesting nonlinear case of 
Q in (11.41) is 

Q(x) = \/c 2 + x 2 , 

where c > 0 is a parameter. In the latter case, the model is described by equations 
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Note at > c > 0 in (11.51) is nonnegative and separated from 0 if c > 0. Particular cases of 
volatility forms in (11.51) are: 


at = \J(? + (a + br t -i ) 2 (Engle’s [14] asymmetric ARCH(l)), (1.6) 

o-t = c 2 + (a + -^2r t -j) 2 , (1.7) 

\| P j =i 

OO 

a t = |a + y^bj7- t -j,| (Q(x) = |x|), (1.8) 

1=1 

= \Jc 2 + (a + b((l - L)- d - l)r t ) 2 . (1.9) 


In (jl.6l) - (ll.9p . a, 6, c are real parameters, p > 1 an integer, Lxt = xt~\ is the backward 
shift, and (1 — L)~ d xt = Yl < jLo ( Pj x t-j: Vj = r (d + j)/T (d)T (j + l),^o = 1 is the fractional 
integration operator, 0 < d < 1/2. The squared volatility (conditional variance) a/ in 
(ll.6D - Ql.9p and Q1.5D is a quadratic form in lagged returns 77 - 1 , 77 - 2 , • ■ ■ and hence represent 
particular cases of Sentana’s [3Q] Quadratic ARCH (QARCH) model with p = oo. It 
should be noted, however, that the first two conditional moments do not determine the 
unconditional distribution. Particularly, m with Q1.5D generally is a different process 
from Sentana’s m QARCH process, the latter being defined as a solution of a linear 
random-coefficient equation for {rt} in contrast to the nonlinear equation in (II.ID . See also 
Example 2 below. 

Let us describe the main results of this paper. Section 2 obtains sufficient conditions 
on Q,bj and |^| p := E|£o| p for the existence of stationary solution of Q1.1D - Q1.4D with finite 
moment E|?y| p < oo, p > 0. We use the fact that the above equations can be reduced to 
the ‘nonlinear moving-average’ equation 


X t = Y u h-sCsQ{a + x s) 

S<t 

for linear form X t := Y/ S< t ^t-s^s in (|1-4D . and vice-versa. Section 3 aims at providing weak 
dependence properties of with V in Q1.4D . in particular, the invariance principle for 
Lipschitz functions of {rt} and {Xt} , under the assumption that bj are summable and decay 
as j -7 with 7 > 1. Section 4 discusses long memory property of the quadratic model in Q1.5D . 
For bj ~ j —> oo, 0 < d < 1/2 as in Q1.9I) . we prove that the squared process { r/} has 

long memory autocorrelations and its normalized partial sums process tend to a fractional 
Brownian motion with Hurst parameter H = d + 1/2 (Theorem HOP . Finally Section 5 
establishes the leverage effect in spirit of HE, viz., the fact that the ‘leverage function’ 
hj := Cov(of, rt-j), j > 1 of model (1 1.5 j) takes negative values provided the coefficients a 
and bj have opposite signs. All proofs are collected in Section 6 (Appendix). 

Notation. In what follows, C,C(...) denote generic constants, possibly dependent on 
the variables in brackets, which may be different at different locations, at ~ bt(t —>• oo) is 
equivalent to li m t ^oo a t/bt = 1. 
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2 Stationary solution 


This section discusses the existence of a stationary solution of (11.11) with V of (11.dl) . viz., 

n = C tQ(a + y; b t - 8 r 3 ), ieZ. (2.10) 

s<t 

Denote 

X t ■= 22bt-sr s . (2.11) 

S<t 

Then r* in ( 12.101) can be written as rt = CtQ(a + Xt) where (12.111) formally satisfies the 
following equation: 

Xt = 22 bt-sCsQ{a + X s ). (2-12) 

S<t 

Below we give rigorous definitions of solutions of (12.101) and ()2.121) and a statement (Propo¬ 
sition [3]) justifying (12.121) and the equivalence of (12.101) and ()2.121) . 

In this section we consider a general case of (12.10[) - (12.12f) when the innovations may have 
infinite variance. More precisely, we assume that E Z} are i.i.d. r.v.’s with finite 

moment \p\ p := E|^| p < oo, p > 0. We use the following moment inequality. 


Proposition 1 Let { Yj,j > 1} be a sequence of r.v. ’s such that E|Y)j p < oo for some p > 0 
and the sum on the r.h.s. of (12.131) converges. If p > 1 we additionally assume that {Yj} 
is a martingale difference sequence: E[Y)|Yi, • • • , Y)_i] = 0, j = 2,3, • • • . Then there exists 
a constant K p depending only on p and such that 


OO 




E,“,Era», 


0 < p < 2, 
p > 2. 


(2.13) 


Remark 1 In the sequel, we shall refer to K p in ()2.131) as the Rosenthal constant. For 
0 < p < 1 and p = 2, inequality (j2.131) holds with K p = 1, and for 1 < p < 2, it is known as 
von Bahr and Esseen inequality, see [33], which holds with K p = 2. For p > 2, inequality 
()2.131) is a consequence of the Burkholder and Rosenthal inequality (see [7], [22], also [IS], 
Lemma 2.5.2). Os§kowski [261 proved that K]J V < (| + 1) 1 / p (l + log ^ 2 ) )’ ' n 

particular, K 2^ < 27.083. See also 23!. 


Let us give some formal definitions. Let J~t = cr(C s , s < t), t G Z be the sigma-field gener¬ 
ated by ( S ,s <t. A random process {ut,t E Z} is called adapted (respectively, predictable ) 
if ut is Tj-measurable for each t E Z (respectively, ut is J-f_i-measurable for each ( E Z). 
Define 


o < p < 2, 

p>2. 


(2.14) 


Definition 2 Let p > 0 be arbitrary. 

(i) By IP-solution of (12.101) we mean an adapted process {rt,t E Z} with E|r f | p < oo such 
that for any t E Z f/ie series ^2 s<t bt- s r s converges in L p and (12.101) holds. 
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(ii) By L p -solution of (12.121) we mean an predictable process {X t ,t G Z} with E|X 4 | P < oo 
such that for any t G Z t/ie series Y2 s< t b t _ s ( s Q(a + X s ) converges in L p and (12.12[) holds. 

Let Q(x),x G M be a Lipschitz function, i.e., there exists Lipg > 0 such that 

\Q(x) - Q{y)\ <Up Q \x - y\, x,yeR. (2.15) 

Note (|2.15l) implies the bound 

Q 2 {x) < c 2 + c 2 x 2 , x G R, (2-16) 

where c\ > 0, C 2 > Lipg and C 2 can be chosen arbitrarily close to Lipg; in particular, (12.161) 
holds with c 2 = (1 + e 2 )Lipg, c 2 = Q 2 ( 0)(1 + e~ 2 ), where e > 0 is arbitrarily small. 

Proposition 3 Let Q be a measurable function satisfying (12.161) with some ci,C 2 > 0 and 
{C*} be an i.i.d. sequence with \gt\ p = E|£o| p < oo and satisfying E£o = 0 for p > 1. In 
addition, assume B p < oo. 

(i) Let {Xt} be a stationary L p -solution of (12.121) . Then {rt '■= CtQ(o,-\-Xt)} is a stationary 
L p -solution of (12.10|) and 

E \r t \ p < C(l + E\X t \P). (2.17) 

Moreover, for p > 1, {rt, Xt,t G Z} is a martingale difference sequence with 

E[r t |^ t _i] = 0, E[\r t \ p \X t -i] = M P |Q(a + ^ b t - s r s ) \ p . (2.18) 

S<t 

(ii) Let {rt} be a stationary L p -solution of (12.101) . Then {Xt} in (12.lip is a stationary 
LP-solution of (12.121) such that 

E\X t \ p < CE\r t \ p . (2.19) 

Moreover, for p > 2 

OO 

E[X t Xo] = E rlY^bt+sbs, t = 0,1,.... (2.20) 

S=1 

Remark 2 Let p > 2 and \p\ p < oo, then by inequality (12.131) . {r t } being a stationary 
L p -solution of (12.101) is equivalent to {r t } being a stationary L 2 -solution of (12.101) with 
E|ro| p < oo. Similarly, if Q and {Ct} satisfy the conditions of Proposition [3] and p > 2, 
then {Xt} being a stationary L p -solution of (12.12|) is equivalent to {Xt} being a stationary 
L 2 -solution of (12.121) with E|Xo| p < oo. 

The following theorem obtains a sufficient condition in (12.211) for the existence of a station¬ 
ary L p -solution of equations (12.lOj) and (12.121) . Condition (12.211) involves the pth moment of 
innovations, the Lipschitz constant Lipg, the sum B p in (12.141) and the Rosenthal constant 
K p in (12.131) . Part (ii) of Theorem [4] shows that for p = 2, condition (12.211) is close to 
optimal, being necessary in the case of quadratic Q 2 (x ) = c 2 + c 2 x 2 . 
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Theorem 4 Let the conditions of Proposition\^be satisfied, p > 0 is arbitrary. In addition, 
assume that Q satisfies the Lipschitz condition in (|2.15l) . 


(i) Let 

K p \p\ p Up p Q B p < 1. (2.21) 

Then there exists a unique stationary L p -solution {Xt} of (12.121) and 


E\X t \ p < 


C{p,Q)\ii\ p B p 

1 - K p \p\pLvp p Q B p ' 


( 2 . 22 ) 


where C(p,Q) < oo depends only on p and c\,ci in (12.16|) . 

(ii) Assume, in addition, that Q 2 (x) = c 2 + c 2 x 2 , where Ci > 0,i = 1,2, and p ,2 = E£g = 1. 
Then cf B 2 < 1 is a necessary and sufficient condition for the existence of a stationary 
L 2 -solution {X t } of ( 12 . 12 |) with a 0 . 


Remark 3 Condition (|2.21l ) agrees with the contraction condition for the operator defined 
by the r.h.s. of (|2.12l) and acting in a suitable space of predictable processes with values 
in L p . For the LARCH model, explicit conditions for finiteness of the pth moment were 
obtained in m, m using a specific diagram approach for multiple Volterra series. For 
larger values of p > 2, condition (12.211) is preferable to the corresponding condition 

(2 p -p-l) l / 2 \p\ l J p B l p / p <1, p = 2,4,6,--- , (2.23) 

in (US], (2.12)) for the LARCH model, since the coefficient (2 P — p— l) 1 / 2 grows exponentially 
with p in contrast to the bound on K p - r> in Remark |T] See also m, sec. 4.3). On 
the other hand for p = 4 (12.231) becomes y/\A\ii^J^B^ 2 < 1 while ()2.21l) is satisfied if 
K 1 By 2 < 27.083|/i|y i I?2 // " < lj see Remark[T] which is worse than (I2.23p . 

Example 1 (The LARCH model) Let Q(x) = x and {Ct} be a standardized i.i.d. se¬ 
quence with zero mean and unit variance. Then (12.12|1 becomes the bilinear equation 

X t = Y^ b tsUa + X s ). (2.24) 

S<t 

The corresponding conditionally heteroscedastic process {r t = (i(a+A t )} in Proposition^) 
is the LARCH model discussed in [IS], [FT] and elsewhere. As shown in ([IS], Thm.2.1), 
equation (|2.24l) admits a covariance stationary predictable solution if and only if B 2 = 
YlpLi < 1. Note the last result agrees with Theorem |4] (ii). A crucial role in the study 
of the LARCH model is played by the fact that its solution can be written in terms of the 
convergent orthogonal Volterra series 

OO 

Xt = yi bt— si ■ ■ ■ bsk-i-skCsi • • • Csfc- 

k= 1 s k <---<si<t 

Except for Q(x) = x, in other cases of (|2.121) including the QARCH model in (11.51) . Volterra 
series expansions are unknown and their usefulness is doubtful. 
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Example 2 (Asymmetric ARCH(l)) Consider the model (11.11) with a t in (11.61) . viz. 

n = Ct(c 2 + (a + 6r t _i) 2 ) 1/2 , (2.25) 

where {Ct} are standardized i.i.d. r.v.’s. By Theorem [4] (ii), equation (12.251) has a unique 
stationary solution with finite variance Er 2 = (a 2 + c 2 )/(l — b 2 ) if and only if b 2 < 1. 

In parallel, consider the random-coefficient AR(1) equation 


r t = K£ t + b'qtJt- 1 , (2.26) 

where {(e^r^)} are i-i.d. random vectors with zero mean Ee* = E rjt = 0 and unit variances 
E[e 2 ] = E [r] 2 ] = 1 and k, b are real coefficients. As shown in Sentana j30] (see also Surgailis 
[321 ), equation (12.261) has a stationary solution with finite variance under the same condition 
b 2 < 1 as (12.251) . Moreover, if the coefficients k and p := E [strjt] E [—1,1] in (12.261) are related 
to the coefficients a, c in (|2.25l) as 

up = a, k 2 = a 2 + c 2 , (2.27) 

then the processes in (12.251) and (|2.26l) have the same volatility forms since 

a 2 := E[rf|r s ,s<t] = k 2 + 2Kbp + b 2 rf_ 1 

= c 2 + (a + br t ~ i) 2 

agrees with the corresponding expression a 2 = c 2 + (a + brt- 1 ) 2 in the case of (11.61) . 

A natural question is whether the above stationary solutions {rt} and {rt} of (|2.25l) 
and (I2.26p . with parameters related as in (I2.27p . have the same (unconditional) finite¬ 
dimensional distributions? As shown in ([32], Corollary 2.1), the answer is positive in the 
case when {^t} and {(st,rjt)} are Gaussian sequences. However, the conditionally Gaussian 
case seems to be the only exception and in general the processes {r^} and {rt} have different 
distributions. This can be seen by considering the 3rd conditional moment of ([2.251) 

E[r t 3 |r t _i] = /i 3 (c 2 + (a + 6r t _i) 2 ) 3/2 (2.28) 

which is an irrational function of rt —i (unless = E£q = 0 or b = 0), while a similar 
moment of ([2.261) 

E[rt\r t -i] = k 3 i^ 3 ,o + 36K 2 i/ 2 ,in_i + 36 2 kzz 1i2 ?|_ 1 + 6 3 z/ 0i3 rf_i (2.29) 

is a cubic polynomial in rt— i, where Vi.j := E[cq'/ 7 q]. Moreover, ([2.281) has a constant sign 
independent of rt-\ while the sign of the cubic polynomial in (I2.29P changes with rt ranging 
from oo to —oo if the leading coefficient b 3 u o i3 ^ 0. 

Using the last observation we can prove that the bivariate distributions of (rt,rt~ i) and 
(rt,rt— i) are different under general conditions on the innovations and the parameters of 
the two equations. The argument is as follows. Let b > 0, c > 0, /r 3 > 0, i^o ,3 = Er/g > 0. 
Assume that Co has a bounded strictly positive density function 0 < f(x) < C, x € M and 


7 






























(£ 0 ]%) has a bounded strictly positive density function 0 < g(x,y) < C, (x,y) € M 2 . The 
above assumptions imply that the distributions of ry and r t have infinite support. Indeed, 
by (12.2511 and the above assumptions we have that P(rt > K) = J K P(c 2 + (a + br t - 1 ) 2 > 
( K/y) 2 )f(y)dy > 0 for any K > 0 since lim^oo P(c 2 + (a + br t - 1 ) 2 > ( K/y) 2 ) = 1. 

Similarly, P(r t > K) = J r2 P(r*_i > (K — nx)/by)g(x, y)dxdy > 0 and P(r* < —K) > 

0,P(fi < —K) > 0 for any K > 0. Since h(x) := p 3 (c 2 + (a + bx ) 2 ) 3 ^ 2 > 1 for all \x\ > K 
and any sufficiently large K > 0, from (12.28p we obtain that for any K > 0 

Erfl(rt-\ > K) = Eh(rt-i)l(rt-i > K) > 0 and 
Erf l(rt_i < —K) = E/i(r t _i)l(r t _i < -K) > 0. (2.30) 

On the other hand, since h(x) := k 3 v 3 fi+3bK 2 v 2 ^x-\-3b 2 KUi y2 x 2 +b 3 vq )3 x 3 > 1 for x > K and 
h(x) < — 1 for x < —K and K large enough, from (12.291) we obtain that for all sufficiently 
large K > 0 

Erfl(rt_i > K ) = Eh(rt-i)l(rt~i > K) > 0 and 
Erfl(ft_i < -K) = Eh(r t -i)l(r t -i < -K) < 0. (2.31) 

Clearly, (12.301) and (12.311) imply that the bivariate distributions of 1 ) and (Ft,?i_ 1 ) 

are different under the stated assumptions. 

For models (|2.25l) and (12.261) . we can explicitly compute covariances p(t) = cov(rf, rg), p(t) 
cov(rf ,?§) and some other joint moment functions, as follows. 

Let p 3 = ECf = 0 ,im = E^g < oo and m 2 := Erg, 7773 (f) : = Erfrg, 7774 (f) := Er 2 r^,t > 0. 
Then 


m 2 = (a 2 + c 2 )/(1 — ft 2 ), m 3 (0) = 0, 

771 , 3 ( 1 ) = E[((a 2 + c 2 ) + 2 abr 0 + & 2 rg)r 0 ] = 2 abm 2 + b 2 m 3 (0) 

m 3 (t) = E[((a 2 + c 2 ) + 2abr t -\ + 6 2 rf_ 1 )r 0 ] = b 2 m 3 (t- 1) 

= M(L±£V-», t >i. 

1 — b 2 

Similarly, 

m 4 ( 0 ) = / 74 E[((a 2 + c 2 ) + 2a6r_i + 6 2 r 2 x ) 2 ] 

= /U 4 {(a 2 + c 2 ) 2 + ( 2ab) 2 m 2 + b^m^O) + 2 b 2 (a 2 + c 2 )m 2 }, 

7714 (f) = E[((a 2 + c 2 ) + 2abr t -\ + feV^^rg] = (° 2 + c 2 )m 2 + b 2 m±{t — 1), t > 1 


= 2abm 2 , 

= ... = ft^-Dmsa) 

(2.32) 


resulting in 


and 


7774(0) 


7774 (f) 


/U 4 ((a 2 + c 2 ) 2 + ((2a&) 2 + 2(a 2 + c 2 )b 2 )m 2 ) 

1 — /U4& 4 
1 — 6 2t 

m 2 (a 2 + c 2 )-- - -pr + 6 2i m 4 (0), t > 1, 

1 — 


(2.33) 


p(f) = ( 7774 ( 0 ) — m 2 )b 2t , t > 0. 


(2.34) 
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In a similar way, when the distribution of £o is symmetric one can write recursive linear 
equations for joint even moments E[r 2p (0)r 2p (f)] of arbitrary order p = 1,2,... involving 
E[r 2i (0)r 2p (f)], 1 < l < p — 1 and 474 2 fc(0) = E[r 2fc (0)], 1 < k < 2p. These equations can be 
explicitly solved in terms of a, b, c and /i 2 fc, 1 < k < 2p. 

A similar approach can be applied to find joint moments of the random-coefficient AR(1) 
process in (|2.26lh with the difference that symmetry of (eo, Vo) is not needed. Let m 2 := Erf, 
fh 3 (t) : = E[rfr 0 ], 4444 (f) := E[rfrf] and p(t) : = Cov(rf,rf), v i}i := E[e l 0 Vo\- Then 


m2 = k 2 /(1 - 6 2 ), 

m 3 (0) = E[(ree 0 + bpor-i) 3 } = k 3 z/ 3j0 + 3K& 2 77i i2 m 2 + b 3 v 0t3 m 3 (0), 
m 3 (l) = E[(k + 2npbr 0 + b 2 r%)r 0 ] = 2npbm2 + 6 2 m 3 (0), 

4743(f) = E[(k 2 -)- 2npbr t _i + b 2 ? 2 ^)? 0 ] = b 2 fh 3 (t - 1) = ■ ■ ■ = b 2< ' t ~ 1 ' > m 3 ( 1), t > 2, 
and 


771 . 4 ( 0 ) = E[(k £ 0 + 677 0 F_i) 4 ] 

= k 4 ^4,o + 6 K 2 b 2 V 2 , 2 rn 2 + 4«;6 3 ^i )3 m 3 (0) + 6 4 770,44744 (0), 

m 4 (l) = E[(K£ t + brjtro) 2 r%) = « 2 m 2 + 2K,pbm 3 (0) + fe 2 m 4 ( 0 ) 

4744 (f) = E[(K£ t + bptn-ifrl] = K 2 m 2 + 6 2 m 4 (t - 1), t > 2, 

k 3 ^ 3 ,o + 3 K 6 2 i/i j 2 m 2 
1 - ^ 0 , 3& 3 

b 2tyt ~ 1 \2Kpbm2 + b 2 fh 3 ( 0 )) f > 1 . 

K 4 I/ 4j 0 + 6 k 2 6 2 1^2,27772 + 4k6 3 I/i 3 777 3 (0) 

-i-n-* ( 2 - 35 ) 

1 - 7^0,4ft 4 

/1 _ 5 2 * x 

777 2 tc 2 —p-j + fe 2 t (m 4 ( 0 ) + 2npm 3 (0)/b), t> 1 , 

Pi{t) = 6 2(t_1) p 4 (l), t > 1, 

f?4 ( 1 ) = 2pnbfh 3 (0) + 6 2 (777,4(0) — TTif). 

Then if Z 7 3 0 = i / 12 = 0 we have 477,3(0) = 0 and /74(f) = (447,4(0) — fh 2 )b 2t ; moreover, m 2 = m 2 
in view of (| 2 . 27 l) . Then /74(f) = /74(f) is equivalent to 4744(0) = 7744(0), which follows from 


leading to 

774 3 (0) = 

"4 3 (f) = 

4744 ( 0 ) = 
7744 (f) = 

and 


//4 = 770,4 = 774,0 and 6i7 2 , 2 = /44(4z7 2 l + 2), (2.36) 

see (12.341) . (|2.33l) . (I2.35p . Note that (12.3611 hold for centered Gaussian distribution (eo ; 7/o) 
with unit variances Eef = E/ 7 2 = 1. 
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3 Weak dependence 


Various measures of weak dependence for stationary processes {y t } = {yt,t G Z} have been 
introduced in the literature, see e.g. [9]. Usually, the dependence between the present (t > 0) 
and the past (t < — n ) values of {yt} is measured by some dependence coefficients decaying 
to 0 as n —>• oo. The decay rate of these coefficients plays a crucial role in establishing many 
asymptotic results. The classical problem is proving Donsker’s invariance principle: 

1 [nr] 

—= y— E y t ) —> ctB(t), in the Skorohod space D[ 0,1], (3.37) 

where B = {B(t),t G [0,1]} is a standard Brownian motion. The above result is useful 
in change-point analysis (Csorgo and Horvath [8]), financial mathematics and many other 
areas. Further applications of weak dependence coefficients include empirical processes 
m and the asymptotic behavior of various statistics, including the maximum likelihood 
estimators. See Ibragimov and Linnik [24] and the application to GARCH estimation in 

m- 

The present Section discusses two measures of weak dependence - the projective weak 
dependence coefficients of Wu |35j and the r-dependence coefficients introduced in Dedecker 
and Prieur m, m - for stationary solutions {rt},{X t } of equations (12.101) . (|2.12jl . We 
show that the decay rate of the above weak dependence coefficients is determined by the 
decay rate of the moving average coefficients bj. 

3.1 Projective weak dependence coefficients 

Let us introduce some notation. For r.v. £, write ||£|| p := E 1/,p |£| p , p > 1. Let {yt,t G Z} 
be a stationary causal Bernoulli shift in i.i.d. sequence {Ct}> i n other words, 

Vt = f(Cs,s < t), fGZ, 

where / : M N -> 1 is a measurable function. We also assume Eyo = 0, ||yo||! = ^Vo < 00 • 
Introduce the projective weak dependence coefficients 

uJ 2 (i',{yt}) ■= ||/i(£o) - fi(£o)h, 5 2 (i;{y t }) := ||/(Ci) — /(CDlb, (3.38) 

where & := (• ■ ■ , C-i, Co, Cl,''' Xi), & ■= (' • • , C-i, Co> Cl,''' ,C i), fi(£ o) := E[/(&)l£o] = 
E[yj|Tfj] is the conditional expectation and (CcuCtU G are i.i.d. r.v.s. Note the i.i.d. 
sequences £ and £' coincide except for a single entry. Then U 2 {i;{yt}) < 62 ( 1 ', {yt}),i > 0 
and condition 

OO 

y^u 2 (k]{y t }) <00 (3.39) 

k =0 

guarantees the weak invariance principle in (13.371) with a 2 := J2jezCov(yo,yj), see Wu 
[351 . The last series absolutely converges in view of (13.391) and the bound in ([35], Thm. 1), 
implying |Cov(y 0 , Vj)\ < Efelo {y t })u 2 (k + j; {y t }), j > 0. 
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Below, we verify Wu’s condition (13.391) for {X t },{r t } in (|2.12l) . (12.101) . We assume that the 
coefficients bj decay as j -7 with some 7 > 1 , viz., 

37 > 0, c > 0 : \bj\ < cj -1 , V j > 1. (3.40) 

Proposition 5 Let Q satisfy the Lipschitz condition in (12.151) . p > 1, K p \p\pLvp P QB p < 1 
(see (12.211) 1. and {X t },{r t } be stationary L p -solutions of (12.121) . (12.101) . respectively. In 
addition, assume that bj satisfy (13.401) with 7 > max{l/2,1/p}. 

Then 

5 p (k;{X t }) = 0(k^) and 5 p {k-{r t }) = 0{k^). (3.41) 

The next corollary follows from the above-mentioned result of Wu m, relations 62 {k\{yt}) 
< C6 2 (k-{r t }), d 2 (k-{zt}) < C5 2 (k-{x t }) and 


Corollary 6 Let {yt := h(r t )},{zt := h(X t )}, where {X t },{r t } are as in Proposition^ and 
h : K —>• R is a Lipschitz function. Then 


[tit] 

n-^J2(y t -Ey t ) -^d[ 0i1 ] c y B{j) and 
t =1 


[nr] 

n -i/2 ^( Zt _ E zt) ->r>[ 0 ,i] c z B(t), 
t =1 


(3.42) 


where B is a standard Brownian motion and 

4 := ^2 Cov (yoiyt) < 00 , 
t&L 


c 2 z := ^2 Cov( 2 0 , z t ) < 00 . 

fgZ 


3.2 r-weak dependence coefficients 

Let {yt,t € Z} be a stationary process with ||yo||p < 00 , p E [l,oo]. Following Dedecker and 
Prieur (10] . im, we define the r-weak dependence coefficients 


T p({yji}i<i<k) ■= sup \E[f(y jir -- ,y jk )\y t ,t< 0] - E[/( yjl , • • • ,y jk )]\ 
fe Ai(Kfc) 


measuring the dependence between {yt}t< 0 and {%,}i<j<fc; 0 < j\ < ■ ■ ■ < j k , and 


T p{ n \{yj}) : = sup k 1 sup TpdyjJ i<i<fe). 

fc>l n<ji<—<jk 

Here, Ai(K fc ) denotes the class of all Lipschitz functions / : —>• M with 

k 

|/(xi,- • • ,x k )~ f(yi,- • • ,yfe)| < |xj - Pi I for any (xi, • • • ,x k ), {yi,-- ■ ,yk) € M fc . 

2—1 

Proposition 7 Let the conditions of Theorem^i) be satisfied, p > 1, and let {Xt},{rt} be 
stationary L p -solutions of (12.121) . (12. 10|) . respectively. In addition, assume that bj satisfy 
(13.401) with 7 > 1. Then 


T P (n;{Xj}) = 0(n 7+1 ), T p (n; {rj}) = 0(n 7+1 ). (3.43) 
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(j 112;. Thm 1) together with Proposition[7]imply the following CLT for the empirical distribu¬ 
tion functions F x {u) := n^ 1 Ylt= 1 1PQ — u )i Ff)(u) := n _1 ^” =1 l(r t < u), u € R of sta¬ 
tionary solutions {Xi},{rt} of (|2.12p . (12.101) . Let F x (u) := P(Vo < u),F r (u ) := P(ro < it) 
be the corresponding distribution functions. See m for the definition of weak convergence 
in the space of all bounded functions on M. 

Corollary 8 Let the conditions of Proposition [?] hold with p = 1 and 7 > 5. Moreover, 
assume that F x ,F r have bounded densities. Then { \/n(F, x (■ u ) — F x {u)),u € M} and 
{y/n(F^(u) — F r (u)),u € M} converge weakly in as n —>• 00 towards Gaussian pro¬ 

cesses on M with zero mean and respective covariance functions 

Cov(l(Xo < u), 1 (Xk < u)), and Cov(l(ro < it), l(rfc < it)). 

fcgZ fcgZ 

Remark 4 In |T2] a general tightness condition is proposed in Proposition 6 for alternative 
classes of functions besides indicators of half-lines. Conditions are not immediate to check 
which explains that we restricted to the case of empirical cdf. 

4 Strong dependence 

The term strong dependence or long memory usually refers to stationary process {yt,t £ Z} 
whose covariance decays slowly with the lag so that its absolute series diverges: YlkLi 
\Cov(yo,yk)\ = 00 . Since the variance of Ylk=i Dt usually grows faster than n under long 
memory, Donsker’s invariance principle in (13.371) is no more valid and the limit of the partial 
sums process, if exists, might be quite complicated. Probably, the most important model of 
long memory processes is the linear, or moving average, process yt = Yls<t bt-sCs, where {Cs} 
is an i.i.d. sequence with zero mean and finite variance, and the moving-average coefficients 
bj decay as in (|4.44l) below. Various generalizations of the linear model were studied in 
El, m and other works. See the monograph m for a discussion and applications of long 
memory processes. 

It is natural to expect that the ‘long memory’ asymptotics of bj in (|4.44l) induces some 
kind of long memory of solutions {ry},{Vt} of (11.11) . (|2.12l) . under general assumptions on 
Q. Concerning the latter process, this is true indeed as shown in the following theorem. 

Theorem 9 Let {Xf} be a stationary L 2 -solution of ()2. 12j) . where 

bj ~ /3/ _1 (3 0 < d < 1/2, p > 0) (4.44) 

and Q satisfies the Lipschitz condition in (|2.15l) with Lip qB 2 = Lip^ bj < 1. Then 

Cov(Xo,X t ) ~ A \t 2d ~ l , M 00 and (4.45) 

[nr] 

n d (1/2) A 2 Rrf_|_(i/2)(T), 

t=l 

where R<i+(i/ 2 ) Is a fractional Brownian motion with Xav(B d+ ^i/ 2 ) (t)) = r 2d+1 and Aj := 
p 2 B(d, 1 - 2d)EQ 2 (a + V 0 ), \ 2 : = \\/d{l + 2d). 
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Clearly, properties as in (14.451) do not hold for {r t = C,tQ{a+X t )} which is an uncorrelated 
martingale difference sequence. Here, long memory should appear in the behavior of the 
volatility at = Q(a + Xt), being ‘hidden’ inside of nonlinear kernel Q. The last fact makes 
it much harder to prove it rigorously. In the rest of the paper we restrict ourselves to the 
quadratic model with Q 2 (x) = c 2 + x 2 , or 

n = ( t Jc 2 + (a + '^2b t - s r s S ) , (4.46) 

V s<t 

as in (11.51) . where (recall) {Q} are standardized i.i.d. r.v.s, with zero mean and unit variance, 
and bj,j > 1 are real numbers satisfying (14.441) . 

The following theorem shows that under some additional conditions the squared process 
{r 2 } of (|4.46l) has similar long memory properties as {Xt} in Theorem [9j For the LARCH 
model (see Example 1 above) similar results were obtained in ([15], Thm. 2.2, 2.3). 

Theorem 10 Let {rt} be a stationary L 2 -solution of (I4.46P with bj satisfying (14.441) and 
B 2 = b 2 < 1 . Assume in addition that /i 4 = E[Cq] < oo and Erf < oo. Then 

Cov(r-Q,r 2 ) ~ K 2 f 2d-1 , t —y oo (4.47) 

where k{ := { f‘ff^- i ) 2 B{d, 1 — 2d)Erg. Moreover, 

[nr] 

n -d-i /2 ^2(r? - Erf) ^d[o,i] K 2 H d+( 1 / 2 )(r), n ^ oo, (4.48) 

t =l 

where / 2 ) is a fractional Brownian motion as in ([4.451) and k 2 := K 2 /(d( 1 + 2d)). 

Remark 5 Recently, Grublyte and Skarnulis m extended Theorem m and some other 
results of this paper to a more general QARCH model: 

r t = (tvt, cr l = c 2 + (a + ^btsTs) 2 +^af_ l , f G Z (4.49) 

S<t 

involving lagged variable cr 2 _ 1 . For parametric ‘long memory’ coefficients bj = Pj d ~ 1 as 
in ([4.441) Grublyte et al. [22] discussed quasi-maximum likelihood (QML) estimation of 
parameters a, /3, c, d , 7 from observed sample rt, 1 < t < n satisfying the equations in (I4.49p . 
and proved consistency and asymptotic normality of the QML estimates. Related results 
for parametric LARCH model with long memory were obtained in Beran and Schiitzner [3]. 

5 Leverage 

Given a stationary conditionally heteroscedastic time series {rt} with E|?y | 3 < 00 and 
conditional variance a 2 = Var(r 2 | r s ,s < t), leverage (a tendency of a 2 to move into the 
opposite direction as r s for s < t) is usually measured by the covariance h t - s = Cov(<r 2 , r s ). 
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Following m , we say that {r t } has leverage of order k (1 < k < oo) (denoted by {r 4 } £ £(k)) 
whenever 

hj <0, 1 <j< k. (5.50) 

Note for {r t } in (II.111 . 

h j =E lrj r o\, j = 0,1,... (5.51) 

is the mixed moment function. Below, we show that in the case of the quadratic of in (11.51) . 
viz., 

OO 

n = Ctcrt, CT t = (c 2 + (a + ^2 h j r t-j ) 2 ) l/2 (5.52) 

3 = 1 

and /r 3 = E[Cq] = 0, the function hj in (I5.5ip satisfies a linear equation in (I6.83p . below, 
which can be analyzed and the leverage effect for {rt} in (I5.52P established in spirit of (17] . 
Let L 2 (Z_|_) be the Hilbert space of all real sequences if = ( ifj,j £ Z + ), := {1, 2, • • • } with 

finite norm ||?/|| := (Xqt=i V’f) 1 / 2 < oo. As in the previous sections, let B := (J2JLib‘j) 1 ^ 
and assume that {<^} is an i.i.d. sequence with zero mean and unit variance; /it := EQj, i = 
1, 2, ■ ■ ■ • 

The following theorem establishes a criterion for the presence or absence of leverage in 
model (I5.52|) . analogous to the Thin. 2.4 in | T7] . 

We also note that the proof of Theorem [TT] is simpler than that of the above mentioned 
theorem, partly because of the assumption ^ = 0 used in the derivation of equation (16.831) . 

Theorem 11 Let {r t } be a stationary L 2 -solution of (14.461) with E|r*o| 3 < ooj^ls < oo. 
Assume in addition that B 2 < 1/5 and ^ = E(q = 0. Then for any fixed k such that 
1 < k < oo: 

(i) if ab\ < 0, abj < 0, j = 2, ■ • ■ , k, then {r t } £ £{k) 

(ii) if ab\ > 0, abj > 0, j = 2, ■ ■ ■ , k, then hj > 0, j = 1, • • • , k. 

Particularly, for the Asymmetric ARCH(l) in (|2.25l) with E|ro| 3 < oo,E£q = 0 the 
leverage function is hj = 2 m2ab 2 i~ 1 , see (12.321) . and {r t } £ £{k) is equivalent to ab < 0. 
Apparently, condition B 2 < 1/5 is not necessary for the statement of Theorem IIII although 
a similar condition appears in the study of the leverage effect in am, (si)). 

6 Proofs 

Proof of Proposition}^} (i) Since {X t } is predictable and Q satisfies ()2.16l) so 

E|r t |P = \fj,\ p E\Q(a + X t )\ p 

< \yi\ p E\cl + ^{a + X t )Y 2 

< C(1 + E|WH <C<oo, 
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proving (12.171) . Moreover, if p > 1 then E[r>,|J^-i] = 0 is a stationary martingale difference 
sequence. Hence by Proposition [1] the series in (12.111) converges in L p and satisfies 


E\X t \P < 



Ef.i N”. 

(E“ 1 6 ?)’’ /2 . 


0 < p < 2 
p> 2 


< oo. 


In particular, (tQ(a + ^2 s <t b t-s p s) = CtQ{a + X t ) = r t by the definition of r t . Hence, {r t } 
is a L p -solution of (12.101) . Stationarity of {r^} follows from stationarity of {Xt}. 

Relations ()2.18[) follow from E^lJ^-i] = 0, E[|£t| p | = \p\ p , p > 1, and the fact that 

X t is Xt- 1 -measurable. 

(ii) Since {r t } is a L p -solution of (I2.10p . so r t = (tQ(a + X t ) with X t defined in (12.111) . and 
{Xt} satisfy (|2.12l) . where the series converges in L p . Also note that {Xt} is predictable. 
Hence, {X t } is a Resolution of (12.12[) . By ()2.16[) . 


E\r t \P = \p\ p E\Q(a + X t )\ p < \p\ p E\c 2 + c 2 (a + X t ) 2 \ p / 2 < C(l+E\X t \ p ) < C. 


It also readily follows that, for p > 1, {r t ,Xt,t € Z} is a martingale difference sequence. 
Hence, by the moment inequality in (12.131) . 


E\X t \ p < 



E~i \ b j\ P E\r t -j\ p , 0 < p < 2 

{ET=i b p 2 /P \rt-j\ P Y /2 , P> 2 


CB p E\r t \ p , (6.53) 


proving (|2.19l) . Stationarity of {X t } and (12.201) are easy consequences of the above facts 
and stationarity of {r t }. □ 


Proof of Theorem [/} (i) For n € N define a solution of (|2. 121) with zero initial condition at 
t < —n as 




(n) 


0, t < —n, 

)Cs=-n b t -sCsQ(a + X s (n) ), t > -n, t € Z. 


(6.54) 


Let us show that {xj r > } converges in L p to a stationary Resolution {X t } as n —>• oo. First, 
let 0 < p < 2. Let m > n> 0. Then by inequality (|2.131) for any t > —m we have that 


E|4” ,) -J£, ( ”Y = A>| p { ^ |6,- s | ! ’E| ( J(a + A<’">)|» 

—m<s<—n 

+ ]T |6 t _,| p E|Q(a + iW) - Q(a + x( m ))| p } 

—n<s<t 

= : X p M P {^ n + R", n }. 

Let x P (n) := E£L n N P 

From the bound |a + x\ 2 < (2a 2 /e) + (1 + e)x 2 , valid for any 0 < e < 1/2, and a > 0, 

it follows that 


cj+c 2 2 (a + X M) 2 )| p/2 < <* +(P\( a + X^) 2 \ p / 2 

< C( Cl ,c 2 ) + ^(l + e) p / 2 \X^\P 

< C( ci ,c 2 ) + 4\X^\ P , 
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with C 3 > C 2 > Lipg arbitrarily close to Lip q. Then using (12.161) we obtain 


S'm,n < E \bt-s\ p V\cl+c 2 2 (a + X^) 2 \ p 
—m<s<—n 

< C(Q)K p \n\p X p(t + n) + C 3 E \bt-s\ p m!i m) -XW\r, 

—m<s<—n 

s'L, n < Lip' Y, 

—n<s<t 


Consequently, 

E|E m) -E n) | P < C(Q)K p \v\pXp(t + n) + K p \n\ P 4 E \h-s\ p E\X^ - X^\ p 

—m<s<t 


Iterating the above inequality, we obtain 


E|X t (m) - E n) r < C(Q)K p Up[ Xp {t + n) + J2(KMk4) k (6.55) 

k=\ 

x E \ b t-s 1 \ P \K-s 2 \ p ---\bs k _ 1 ~s k \ p Xp(sk + n)Y 

—m<Sk<’-<si<t 

Since K p \^\ p (F^B p < 1 by (12.211) and sup s>1 Xp( s ) < B p < 00 , the series on the r.h.s. of 
(16.551) is bounded uniformly in m, n and tends to zero as m, n —>• 00 by the dominated 
convergence theorem. Hence, there exist Xt, t € Z such that 


lim E\X t -X^ n) \ p = 0, Vf€Z. (6.56) 

n—>00 

Note that {X t } is predictable and 

E\X t \P = lim E\X$ n) \P < °(Q) K Mp B p < C{ Pl QM p B p , 

n^oo 1 - Kplulpd^Bp 1 - K p \n\ p Lip p Q Bp 

where the last inequality follows by taking C 3 > Lip^ sufficiently close to Lipg. 

We also have by (12.221) and ([2.151) that 


t -1 

E|E bt-s(sQ(a + X s )- E btsCsQ(a + X^)\ p 

s<t s=—n 

t -1 

= E| E bt-s(sQ(a + X s )+ E bt-s(s{Q(a + X s )-Q(a + xW))\ p 

s<—n s=—n 

< K p \n\p{ E \bt-s\ p E\Q(a + X s )\ p + E |^-,| p E|g(a + X s )-Q(a + X^)| p } 

s<—n —n<s<t 

< c( e i^p+Ei^^-^r) -»• ° 

s<—n s<t 


as n —>• 00 . Whence and from (16.541) it follows that {Xt} is a stationary L p -solution of 
() 2 . 121 ) satisfying ( 12 . 221 ) . 

To show the uniqueness of stationary L p -solution of (12.121) . let {X' t },{X"} be two such 
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solutions of (12.1211 . and m p (t ) := E|X^ — X"\ p . 

Then sup te z m p (t) < M < oo and m p (t) < K p \/i\pLipQ ^2 s<t \bt- s \ p m p (s ) follows by (|2.15|1 . 
Iterating the last equation we obtain that m p (t ) < ( K p \fj,\ p LipQB p ) k M holds for all k > 1, 
where K p \p\ p LipQB p < 1 . 

Hence, m p (t ) = 0. This proves part (i) for 0 < p < 2. 

The proof of part (i) for p > 2 is analogous. Particularly, using (12.13|) as in (|6.53j) . we obtain 
E\X t \ p < K p \p\ p {^bl s E 2 / p \Q(a + X s )\ p ) p/2 

S<t 

< km p { ]T b 2 _ s (C(Q) + 4E\X S \P) 2 /P) P/2 

s<t 

< K p \tx\ p B p (C{p,Q)+ 4supE\X s \ p ) 

s€Z 

implying (1 — K p \p p \(^B p ) sup teZ E|X t | p < C(p, Q)\p\ p B p and hence the bound in (12.2211 for 
p > 2, by taking C 3 sufficiently close to Lipg. This proves part (i). 

(ii) Note that Q(x) = \fc\_ + c 2 x 2 is a Lipschitz function and satisfies (12.151) with Lipg = C 2 . 
Hence by K 2 = 1 and part (i), a unique L 2 -solution of (12.1211 under the condition 

YB 2 < 1 exists. To show the necessity of the last condition, let {Xt} be a stationary 
L 2 -solution of (12.121) . Then 

EX 2 = Y, b t-sEQ 2 (a + X s ) 

S<t 

= ^ b t-sE{cl + c 2 (a + X s ) 2 ) 

S<t 

= B 2 (ci + c 2 2 (a 2 + EX 2 )) > c 2 B 2 EX 2 

since a/0. Hence, C 2 -B 2 < 1 unless EX 2 = 0, or {X t = 0} is a trivial process. Clearly, 
( 12 . 121 ) admits a trivial solution if and only if 0 = Q(a) = \Jc\ + c 2 a 2 = 0 , or c\ = C 2 = 0 . 
This proves part (ii) and the theorem. □ 

The proofs of Proposition [5] and Theorem [TO] use the following lemma. 

Lemma 12 For a 2 > 0, j = 1, 2, ■ ■ • , denote 

Ak •— Oik T ^ ^ ^ OLi^OL\ 2 —i\ ' ' ' Oti p —i p _ 1 Qtk—i p i k — 1,2, . (6.57) 

0 <p<k 0<i-L<---<ip<k 

Assume that A := JfJLi a 'J < ^ an ^ 

otj < cT\ (3 c > 0, 7 > 1). (6.58) 

Then there exists C > 0 such that for any k > 1 

A k < Ck~ 7 . (6.59) 

Proof. We have A k = J2o< P <k A k lP , where 

A-k^p 1 = ^ ^ ^21^22—21 ’ ’ ’ ^ip—ip—i^k—ip (P — Ak f 0 := OL k 

0 <ii<---<i p <k 


17 






















is the inner sum in ([6.571) . W.l.g., assume c > 1 in (16.581) . Let us prove that there exists 
A > 0 such that 

Ak, P < c(p + 2) A A p+1 fc -7 , \/ 0 <p<k<oo. (6.60) 

Since A < 1, so (16.601) and i.P + 2) a A p+1 < oo together imply (16.591) . 

By dividing both sides of (16.601) by A p+1 , it suffices to show (16.601) for A = 1. The 
proof uses induction on p. Clearly, (16.601) holds for p = 0. To prove the induction step 
p — 1 —» p > 1, note 


Ak, P ^ ' otiA t 


i^*-k—i,p— 1 


^ ^ z,p—1 “1“ ^ ^ CX-i-A-h—i,p— 1* (6.61) 


0 <i<k 


p +1 


<i<k 


k— -^^■<k—i<k 


Here, a*l(i > ^-) < ci 7 l(i > < c(jp + 1 ) 7 fc 7 and, similarly, by the inductive 

assumption 

Ak-i,p-i±{k -i >k -2—) < c(p + l) A (/c--—r)~ 7 = c (p + 1 ) A (^2i) 7 L _7 . 

p + 1 p + 1 p ' 


Assumption A = 1 implies X]fc>o = 1 f° r an Y P > 0. Using the above facts from (16.611) 
we obtain 


„ _ c(p+ l) 7 „ , c(p+l)\p+1 A7 

Ak ’P ~ k'y 2^ + i, 7 l ^ i 


fc/(p+l)<i<fc 


kP y p 


E 




/c—fc/(p+l)</c—z<fc 


A^+ 1 \7\,.-7 


< c((p + l) 7 + (p + 1) A ( ) 7 )fc 


Hence the proof of the induction step p — 1 —>• p > 1 amounts to verifying the inequality 
(p+l) 7 + (p+l) A (2±±) 7 < (p + 2)\ or 


n 


7 + n A (——r) 7 < (n + 1) A , n = 2,3,.... 

v n — 1 


(6.62) 


The above inequality holds with A = 37 . Indeed, 


n 


7 + n A ( n ) 7 = n A (n 27 + ( ^ ) 7 ) < n A (n 2 + ( )) 

n — 1 n — 1 to — 1 


< n A (l + ^- + 2_)7< n A (1+ 3 + 3 + 1 )7 = (n + 1)A 

n — 1 n z ~ ~~ ~~ 


n n 2 n 3 ' 


proving (16.621) and the lemma, too. 


□ 


Proof of Proposition 0 We will give the proof for p > 2 only as the proof for p £ [1,2] is 
similar. 

Following the notation in ([3.381) . let {X[},{r[} be the corresponding processes (Bernoulli 
shifts) of the i.i.d. sequence £' := (■ ■ ■ , (_i, (' 0 , Ci, C 2 , ■ ■ ■) with Co replaced by its independent 
copy Co- Note that X[ = X t (t < 0), r[ = r t (t < 0). We have 5 2 (k] {X t }) = (E|Afc — 
X' k \P) 2 /P = \\X k - X' k II 2 , where 

X k -X’ k = Y, b t - s (r s -r' s ) + b k ((o-QQ(a + X 0 ). 

0 <s<k 
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Then with v 2 := ||Q(a + Xo)||p using Rosenthal’s inequality (|2.13l) similarly as in the proof 
of Theorem [4] we obtain 

\\X k -X' k \\ 2 p < ^p /p ( bl_J\r s -rX + \\(o-(o\\y k vl) 

0 <s<k 

< K 2 p /v( £ + ^)llp + 41^1^6^^ 

0<s<fc 

< Kpl p |/x|p/ p (hi P g ]T bl_s\\Xs-X% + 4blv*). 

0 <s<k 

Let a k := RTp^ p |/r|p^ p Lipg6|. Iterating the last inequality we obtain 

_ AfiP 

&2(k'i{Xt}) < . P (oik + ot s a k - s + • • •) = . P 'Ak, 

ip <3 o<s<fc ip Q 

where A*, is as in ( 16 . 571 ) . Since A = )T ) fc>0 a k = {K p y p LipQB p ) 2 / p < 1 and a k < Ck~ 2l , 
by Lemma [12] we obtain 52 (k]{Xt}) < C/c -7 , proving the first inequality in (13.4ip . The 
proof of the second inequality in (I3.4ip follows similarly using 5 2 {k\{rt}) = \\r k — r' k \\ 2 < 
Lip 2 Q ^ /p ||X fc - X'X = Up 2 Q 4 /p 5 2 (k ; {X t }). □ 

Proof of Proposition [?| We use the coupling inequality of [11 J in (16.631) . below, providing a 
simple upper bound for r-coefficients. Let {y*} be distributed as {yt} and independent of 
Vs, s < 0. Then 

k 

Tp{{yji}i<i<k) < ~ y pWp and r p( n ’{^}) - SU P \\Vj ~ Vj\\p- ( 6 - 63 ) 

i =l i^ n 

To construct the coupling for {Xt}, let {X*} be the corresponding process (Bernoulli shift) 
of the i.i.d. sequence £*:=(•■■, C-i; Co 5 Ci> ■ ■ •) with (£*, s < 0) an independent copy of 
(Cs,s < 0). Clearly, {X *} is distributed as {X t } and independent of (X s ,s < 0), the latter 
being measurable w.r.t. (C s ,s < 0). Hence, the first relation in (|3.43l) follows from 

\\X n -X * n \\ p = 0( n - 7+1 ). (6.64) 

Towards this end, consider ‘intermediate’ i.i.d. sequence £* :=(■■■ , C-i> C-i+i>''' 5 Co? 
Ci) ■••)>* > 1, Ci := £*■ Note sequences C* and £* +1 agree up to single entry. Let {X* t } be 
the corresponding Bernoulli shift of the i.i.d. sequence £*. By triangle inequality, ||X n — 
X*|| p < || X* n — X* +l n \\ p . By stationarity and Proposition [5] 

\\Xl n -X* +1 J P = 5 P (n + i,{X t }) < C(n + i)~\ (6.65) 

where 5 p is dehned in (13.381) . Clearly, (16.651) implies (16.641) . proving the first relation in 
(13.431) . Since T p (n,{r t }) < C p T p (n,{X t }), the second relation in (13.431) follows. Proposition 
[7] is proved. □ 

Proof of Theorem , [3 The first relation in (14.451) follows from (12.201) and (14.441) . The second 
relation in (14.451) follows from a general result in Abadir et al. ([I], Prop.3.1), using the 
fact that {r s } in (12.121) is a stationary ergodic martingale difference sequence. □ 
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Proof of Theorem 1 1Q[ The proof of Theorem [TO] heavily relies on the decomposition 


( r t ~ Er 2 ) - Y b t-s(r 2 s - Er 2 ) = 2 aX t + Z t , (6.66) 

S<t 

where {Zt} on the r.h.s. of (16.661) is negligible so its memory intensity is less than the 
memory intensity of the main term, {X t }. Accordingly, r 2 — Er 2 = (1 — b 2 !^)- 1 ^ 

behaves like an AR(oo) process with long memory innovations := 2 aXt + Zt ~ 2 aXt- A 
rigorous meaning to the above heuristic explanation is provided below. 

By the definition of I't in (I4.46|) . 


Zt ■= Ut + Vt , where 

U t := (C t 2 -1 )Q 2 (a + X t ), 

Vt ■= X 2 -EX 2 -^6 2 _ s (r 2 -Er 2 ) 

S<t 

= 2 Y bt-s 1 bt-s 2 r si rs2- (6.67) 

S 2 <Sl<t 


Let us hrst check that the double series in (16.671) converges in mean square and (16.671) holds. 
Let 

Xt,N • — \ bt- s r s , L. A •— 2 ^ ) bt—sibt—s2^s\i's2i 

—N<s<t —N<S 2 <si<t 


then V t . N = X 2 N - EX 2 N - 
E(X 2 N - X 2 M f = E(X t , N 


J2-N<s<t b t-s( r s ~ Er 2 ) and, for M > N, 

~ X^m) 2 {Xt,N + Xt t M ) 2 < II^JV — X tjM \\l\\X t ,N + X t , M \\l- 


By Rosenthal’s inequality in (12.131) . 

\\X t ,N + X tM \\l < C Y h2 ts ^ C alld 

—M<s<t 

\\X t , N - X t , M \\l < C Y h t-s -»• 0 (iV, M —» oo). 

—M<s<—N 

Therefore, lim^M^-oo E (X 2 7V - X 2 M ) 2 = 0. 

The convergence of EX 2 ^ and XX a r <s< t b t-s( r s ~ Er 2 ) in L 2 as N —> oo is easy. Hence, 
Vt,N, N > 1 is a Cauchy sequence in L 2 and the double series in (I6.67P converges as claimed 
above, proving (16.671) . 

Let us prove that in the decomposition (|6.66D . {Z t } is negligible in the sense that its 
(cross)covariances decay faster as the covariance of the main term, {Xj}, viz., 

E [Z t Z 0 \=oft 2d ~ 1 ), E[X t Z 0 ]=o(t 2d ~ 1 ), E[Z t X 0 ] = oft™- 1 ) (6.68) 

as t —>• oo. Note, for t > 1, E[UoUt] = E[Vo Ut] = 0 and E\VtUf\ = 26 t E[^o(Co — 1 )Q 2 (a + 
Xo) X) S2 <o b t-s 2 r s 2 ] = 0(bt) = oft™- 1 ). Hence, the first relation in (16.681) follows from 

E[V t V 0 ] = oit™- 1 ), t -> oo, (6.69) 
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which is proved below. Since E[V^ 2 ] < oo,E[Vj] = 0 we can write the orthogonal expansion 

V t = J2 PsV t , 
s<t 

where P s Vt := E[Vt|.F s ] — E[Vt|.F s _i] is the projection operator. 

By orthogonality of P s , 

\EV 0 V t \ = |^E[(P s ^o)(P s y t )]| < Y \\PsVoh\\P s Vt\\2- 

s< 0 s<0 

Relation (16.691) follows from 

\\PsVoWl = o(b 2 _ s ) = o{{-s) 2 ^), s } oo. (6.70) 

Indeed, if (16.701) is true then 

EV 0 V t = o(^(-s) d_1 (f - s) d_1 ) = o{t 2d ~ l ), t y oo, 

s<0 

proving (16.691) . 

Consider (16.701) . We have by (|6.67l) and the martingale difference property of {r s } that 

PsVq — 2 r s b— s ^ ' b— u r u 

U<S 

and 

\\PsVo\\l = 46 2 _ s E[r s 2 (^6_ u r u ) 2 ] < 4& 2 _Jr s || 2 || Yb. u r u \\\. 

u<s u<s 

By Rosenthal’s inequality in (12.131) . 

E|^6_ u r u | 4 < C4j2 b -u( Er t) 1/2 ) 2 < C{ Y « 2(d_1) ) 2 = 0(k| 2(2d ' 1} ) = o(l). 

U<s U<S u>|s| 

Therefore, 

\\PaVo\\Z < C\ S \ 2 ^ +2d ~ 1 = 0{\8\^), 

proving (16.701) . (16.691) . and the first relation in (16.681) . The remaining two relations in (16.681) 
follow easily, e.g., 

E[X t Z 0 ] = & t E[ro(Co 2 - 1 )Q 2 {a + X 0 )] + 2 b t - Sl b- Sl L Sl , 

si<0 

where 

L s i := E [r 2 si Y b -s 2 r S2 ] 

S2<Sl 

< E 1 / 2 [r s 4 jE 1 / 2 [( Y b -s 2 r S2 ) 2 ] 

S2<Sl 

= 0(( £ H n ) 1/2 ) = 0(| Sl |“-( 1 / 2 )), S, -> -oo. 

S2<S1 
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Therefore 


E [X t Z 0 \ = 0(t d ~ 1 )+ ^(t-si) d - 1 (-si) 2d - (3/2) = o(f M ” 1 ). 

.si<0 

This proves (16.681) . 

Next, let us prove (14.471) . Recall the decomposition (16.661) . Denote := 2 aX t + then 
(16.661) can be rewritten as (r 2 — E r 2 ) — b t-s( r s ~ Er^) = 6) or 

OO 

r't - Erf = teZ, (6.71) 

i =0 

where (pj > 0,j > 0 are the coefficients of the power series 

OO OO 

$0) : = X] Vj z ° = (! - 5Z G C > M < 1 

4=o 4=i 

given by <£o := 1, 

Vi ■= 6 4+ S S b 2 sAs 1 '-- b 2 s k -s k _ 1 b 2 j -s k , 3> 1- (6-72) 

0 <k<j 0<si<— <Sk<j 

From (14.441) and Lemma [T2l we infer that 

(Pi = 0(t 2d ~ 2 ), t -> oo, (6.73) 

in particular, <h(l) = = 1/(1 — -B 2 ) < 00 an< l the r.h.s. of (16.711) is well-defined. 

Relation (16.681) implies that that 

7 i := Cov(£o,£t) ~ 4a 2 Cov(Xo, Xt) ~ 4a 2 K|f 2d_1 , t —>• oo (6.74) 

with /c| = f3 2 B(d, 1 — 2d)Erg. Let us show that 

OO 

CovfoVo) = ViVjlt-i+j ~ ^ >2 (l)7ii t ^ oo. (6.75) 

i,4=o 

With (|6. 741) in mind, (|6.75l) is equivalent to 

OO 

J t ’■= X) ViVjilft-i+j ~lt) = o{t 2d ~ l ). (6.76) 

*,4=o 

For a large L > o, split J t = J[ L + J't L , where 

j't,L ■= ( Pi l Pj(jt-i+j -Jt), J"l'= <Pi<Pj(lt-i+j ~Jt)- 

i,j>0:\j-i\<L i,j>0:\j-i\>L 


Clearly, (16.761) follows from 

t l ~ 2 d J[ L = o(l) V L > 0, and lim limsup t 1 ~ 2 d J' t ' L = 0. (6.77) 

’ L-too t -to o 
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The first relation in (16.771) is immediate from (16.741) since the latter implies 7 t+k — 71 = 
o{t 2d ~ 1 ) for any k fixed. 

With (16.731) and (16.74ft in mind, the second relation in (16.771) follows from 

lim lim sup i 1-2 ^^ = 0. (6.78) 

L —>00 t _00 

where := JT j>o |j—i|>i 2 (i 2rf—1 + —*|+ d_1 ) and where k^~ l := min(l, fc 2rf_1 ), k € 

Z+- 

Split the last sum according to whether |i + j — i\ > i/2, or |i + j — *| < i/2. 

Then 


: = 


X i 2d 2 j 2d 2 {t 2d 1 + |i+ j - *| 2d *) 

*J>0:h'-i|>LT+j-«|>t/2 

< Ci 2 ^ 1 X i 2d-2 J .2d-2< Ci 2 (i -l L 2d-l 

i,j>0:\j-i\>L 

fnllnwq hv V ,;2d-2/2d-2 < y- -2d-2 -2d-2 , y- -2d-2 -2d-2 

toiiows by 2_,j,j>0 : |j-j|>L* ■? ^ 2^0<i<L/2,j>L/2 1 3 + L,i>L/2,j>0 1 3 

0(L 2d ~ 1 ). Therefore, lim^oo lim sup^^ i 1_2d J/ L = 0. 

Next, since |i + j — i\ < i/2 implies i > i/2, so with k := t + j — i we obtain 


< 

Ct 2d ~ 2 X 

j 2 d -2 (pid-l 



■i\<t /2 

< 

(jfid -2 j 2d ~ 2 

E (‘ M_1 + i*i 


j> o 

|fc|<t /2 

< 

Cf 4d - 2 , 



— * 


2 d —1 


implying lim sup^^ i 1 2 d J" L = 0 for any L > 0. This proves (16.771) . (16.761) . and (16.751) . 
Clearly, (|4.47l) follows from (I6.75jl and (16.741) . 

It remains to show the invariance principle in (14.481) . With (16.711) in mind, decompose 
Sn(r) := ES(h 2 - Er t) = £f=i 5 «(t), where 


kl 


5m(r) := 2a$(l)'£ t X t , 

t= l 
[nr] 

5 n2 (r) := 

t=i 

[nr] oo 

Sn3(r) ■.= 

t=l *=0 

Here, ES’ 2 2 (r) = o(n 2d+1 ) follows from (16.681) . Consider 

[nr] oo [nr] 

E ^(r) := X X] -6) = X 

t,s=l i,j= 0 t,s=l 

where p t := £°° =0 VWji'it+j-i ~ 7t+j ~ 7t-i + 7t) = o(i 2d ” 1 ) follows similarly to (16.77ft . 
Hence, S n i(r ) = o p (n _rf ~ 1 / 2 ), i = 2,3. The convergence n d 1 ^ 2 5’ n i(r) —^Z)[0,1] K 2-Sd+(l/2)( r ) 
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follows from Theorem [9j 

This completes the proof of Theorem [lOj 


□ 


hi = lim E 


Proof of Theorem{TJ\ Let us first prove that ||h|| < oo. Note that 

lim E( V' b t - s r s ) 2 r 0 = E( V' 6 t _ s r s ) 2 r 0 , (6.79) 

n—>oo x z —' 7 x z —^ 7 

—n<s<t —oo <s<t 

which follows from the definition of Resolution of (14.46|) and Remark [2j Then using (16.79ft . 
Erf = E[r 3 ] = E[rjr s ] = 0, s < t we obtain 

c" + a 2 + 2 a b t - s r s + ^ b 2 

—n<s<t 

T 2 'y ^ bt— sl bi : — S2 r sl r S 2 S jrt—j 

—n<S2<si<t 

= 2am^bj + bf_ s hj+ s -t + 26 7 - lim ER n (t,j) (6.80) 

t—j <s<t 

where R n (t,j ) := r 2 _j Yf n <s<t j b t-s r s ■ Using Holder’s and Rosenthal’s (12.131) inequalities 
we obtain 


>2 2 
\-s r s 


—n<s<t 


\ER n (t,j)\ < E 2/3 |r t _j| 3 E 1/3 | Y, 

—n<s<t—j 

< E|r 0 | 3 R 3 ( b "s ) 3/2 ^ a 


—n<s<t—j 


Hence, 


N < C\bj\ + £ bj_i\hi\ < C(\bj \ + ]T 

0 <i<j 0 <i<j 


(6.81) 


(6.82) 


where the first inequality in (I6.82P follows from (16.80ft and (I6.8ip and the second inequality 
in (16.821) by iterating the first one with <pj as in (16.721) . Since Yl'jLiVj = i^ 2k = 
1/(1 — B 2 ) 1 from the second inequality in (16.821) we obtain ||fo|| < CB/( 1 — B 2 ) < oo. The 
last fact implies E R n (t,j) = Yl%=\ J hih+j —> From (16.801) we obtain that the 

leverage function h E R 2 (Z + ) is a solution of the linear equation: 

hj = 2abjrri2 + 22 bjhj-i + 2 bj 22 b i+jhi, j = 1, 2, • • • . (6.83) 

0 <i<j i> 0 

From Minkowski’s inequality, we get X^>o(Eo<i<j b i h j-i ) 2 < -B 4 |N| 2 , J2j >o ( b j E,:>o b i+j h iY 
< R 4 ||/i|| 2 and then (I6.83P implies that ||/i|| < 2|a|m2-E> + 3R 2 ||h||, or 

2|a|m2-B 


< 


(6.84) 


1-3R 2 

provided B 2 < 1/3. 

Let us prove the statements (i) and (ii) of Theorem 1111 for k = 1. From (|6.83l) it follows 
that 

OO OO 

hi = 2am 2 bi + 2bi'22h u bi +u = 2bi(am 2 + h u bi +u ) 


U= 1 


U= 1 
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Since \^2^Lih u bi +u \ < ||/i||.B, we have sgn(/ii) = sgn(&ia) provided \\h\\B < \a\m 2 holds. 
The last relation follows from (I6.84P and B 2 <1/5; indeed, 


II h\\B< 


2|a|m2-B 2 
1-3 B 2 


< |a|m2- 


This proves (i) and (ii) for k = 1. 

The general case k > 1 follows similarly by induction on k. Indeed, from (|6.83l) we have 
that 


00 k—1 

hk ~ 2&^(tt777/2 + ^ ^ “1“ ^ ^ b k _jhj . 

u =1 j=l 

Assume hi, ■ ■ • , h^-i < 0, then the second term X<=i < 0. Moreover, 

OO 

| ^2 h u bk+u\ < \\h\\B < \a\m 2 

U= 1 


implying that the sign of the first term is the same as sgn(o6fe). 

Theorem ED is proved. □ 
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